Picture for Anas Barakat

Anas Barakat

S2A, IDS, LTCI

Online Learning on Hidden-Convex Losses via Algorithmic Equivalence: Optimal Regret, Geometric Barrier, and Bandit Feedback

Add code
May 25, 2026
Viaarxiv icon

When and Why is Optimistic Multiplicative Weights Slow? The Geometry of Energy Dissipation

Add code
May 13, 2026
Viaarxiv icon

Why Pass@k Optimization Can Degrade Pass@1: Prompt Interference in LLM Post-training

Add code
Feb 26, 2026
Viaarxiv icon

Convex Markov Games and Beyond: New Proof of Existence, Characterization and Learning Algorithms for Nash Equilibria

Add code
Feb 12, 2026
Viaarxiv icon

Multi-Agent Online Control with Adversarial Disturbances

Add code
Jun 23, 2025
Viaarxiv icon

Optimistic Online Learning in Symmetric Cone Games

Add code
Apr 04, 2025
Figure 1 for Optimistic Online Learning in Symmetric Cone Games
Figure 2 for Optimistic Online Learning in Symmetric Cone Games
Viaarxiv icon

On the Sample Complexity of a Policy Gradient Algorithm with Occupancy Approximation for General Utility Reinforcement Learning

Add code
Oct 05, 2024
Figure 1 for On the Sample Complexity of a Policy Gradient Algorithm with Occupancy Approximation for General Utility Reinforcement Learning
Figure 2 for On the Sample Complexity of a Policy Gradient Algorithm with Occupancy Approximation for General Utility Reinforcement Learning
Figure 3 for On the Sample Complexity of a Policy Gradient Algorithm with Occupancy Approximation for General Utility Reinforcement Learning
Figure 4 for On the Sample Complexity of a Policy Gradient Algorithm with Occupancy Approximation for General Utility Reinforcement Learning
Viaarxiv icon

Beyond Expected Returns: A Policy Gradient Algorithm for Cumulative Prospect Theoretic Reinforcement Learning

Add code
Oct 03, 2024
Figure 1 for Beyond Expected Returns: A Policy Gradient Algorithm for Cumulative Prospect Theoretic Reinforcement Learning
Figure 2 for Beyond Expected Returns: A Policy Gradient Algorithm for Cumulative Prospect Theoretic Reinforcement Learning
Figure 3 for Beyond Expected Returns: A Policy Gradient Algorithm for Cumulative Prospect Theoretic Reinforcement Learning
Figure 4 for Beyond Expected Returns: A Policy Gradient Algorithm for Cumulative Prospect Theoretic Reinforcement Learning
Viaarxiv icon

Independent Policy Mirror Descent for Markov Potential Games: Scaling to Large Number of Players

Add code
Aug 15, 2024
Figure 1 for Independent Policy Mirror Descent for Markov Potential Games: Scaling to Large Number of Players
Viaarxiv icon

Policy Mirror Descent with Lookahead

Add code
Mar 21, 2024
Figure 1 for Policy Mirror Descent with Lookahead
Figure 2 for Policy Mirror Descent with Lookahead
Figure 3 for Policy Mirror Descent with Lookahead
Viaarxiv icon